Discovering Geographic Knowledge in Data Rich Environments
نویسنده
چکیده
1 EXECUTIVE SUMMARY Similar to many scientific and applied research fields, geography has moved from a data-poor and computation-poor to a data-rich and computation-rich environment. The scope, coverage and volume of digital geographic datasets are growing rapidly due to new high-resolution satellite systems, initiatives such as the U.S. National Spatial Data Infrastructure and the automated collection of point-of-sale, logistic and behavioral data. In addition, the types of geographic data collected are expanding from the traditional vector and raster data models to include georeferenced multimedia data. This trend is likely to continue if not accelerate in the foreseeable future. Traditional spatial statistical and spatial analytical methods were developed in an era when data collection was expensive and computational power was weak. The increasing volume and diverse nature of digital geographic data easily overwhelm mainstream spatial analysis techniques that are oriented towards teasing scarce information from small and homogenous datasets. Traditional statistical methods, particularly spatial statistics, have high computational burdens. These techniques are confirmatory and require the researcher to have a priori hypotheses. Therefore, traditional spatial analytical techniques cannot easily discover new and unexpected patterns, trends and relationships that can be hidden deep within very large and diverse geographic datasets. The National Center for Geographic Information and Analysis (NCGIA) – Project Varenius workshop on " Discovering geographic knowledge in data-rich environments " brought together a diverse group of stakeholders with interests in developing and applying new techniques for exploring large and diverse geographic datasets. This included geographers, geographic information scientists, computer scientists and statisticians. The synergy created by the discussions prior to, during and after the three-day workshop resulted in the identification of research priorities and directions for continued development of " geographic knowledge discovery " (GKD) theory and techniques. This research report summarizes the activities surrounding the Project Varenius workshop on " Geographic knowledge discovery in data-rich environments. " Section 1 comprises an overview of the workshop. Section 2 lists the workshop participants. Section 3 provides the position papers submitted in response to the open Call for Participation. Section 4 summarizes the workshop presentations and discussion. Many detailed recommendations for continued research and development in GKD theory and techniques emerged during the three-day workshop. Participants also identified several cross-cutting research issues. These issues are: • What are the new questions for geographic research? A fundamental question for the geographic and related research communities is " What questions do we want to …
منابع مشابه
Mining Census and Geographic Data in Urban Planning Environments
Urban planning is a knowledge-intensive activity. It acquires useful knowledge by analysing jointly socio-economic data and topographic maps. When supported by computers, this preliminary information gathering triggers a knowledge discovery process, which often consists of exploratory data analysis tasks. Advances in georeferencing have caused among the other things a growing demand for more po...
متن کاملBuilding new spatial interaction models by using genetic programming and a supercomputer
An application of a parallel genetic programming approach to discovering new spatial interaction models is described. It is noted that geographic information systems have resulted in the creation of extremely data-rich environments but it is proving very difficult to exploit this situation because of the lack of suitable models and the difficulties of model creation by more traditional hypothet...
متن کاملDiscovering Knowledge from High-Dimensional Geographic Data: Integrating Visual and Computational Approaches
It has been widely recognized that spatial data analysis capabilities have not kept up with the need for analyzing the increasingly large volumes of geographic data of various themes that are currently being collected and archived (Openshaw 1991; Miller and Han 2001; Shekhar, Vatsaval et al. 2002; Guo 2003; Guo, Peuquet et al. 2003; Muntz, Barclay et al. 2003). On one hand, such a wealth of dat...
متن کاملMining Multivariate Associations within GIS Environments
As geospatial data grows explosively, needs for the incorporation of data mining techniques into Geographic Information Systems (GISs) are in great demand. Association rules mining is a core technique in data mining and is a solid candidate for the cause-effect analysis of large geospatial databases. It efficiently detects frequent asymmetric causal patterns in large databases. In this paper, w...
متن کاملClassification and Comparison of Methods for Discovering Coverage Loss Areas in Wireless Sensor Networks
In recent years, wireless sensor networks data is taken into consideration as an ideal source, in terms of speed, accuracy and cost, in order to study the Earth's surface. One of the most important challenges in this area, is the signaling network coverage and finding holes. In recent years, wireless sensor networks data is taken into consideration as an ideal source, in terms of speed, accurac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999